Model Selection

Panoptic Segmentation

# Panoptic Segmentation

Mask2former Swin Base IN21k Cityscapes Semantic

A general-purpose image segmentation model based on Swin Transformer, unifying instance/semantic/panoptic segmentation tasks

Image Segmentation

Mask2former Swin Tiny Cityscapes Semantic

Mask2Former is a unified image segmentation framework capable of handling instance segmentation, semantic segmentation, and panoptic segmentation tasks. This model is based on the Swin-Tiny backbone network and has been fine-tuned for semantic segmentation on the Cityscapes dataset.

Image Segmentation

Mask2former Swin Small Cityscapes Semantic

Small version of Mask2Former based on Swin backbone network, specifically trained for Cityscapes semantic segmentation tasks

Image Segmentation

Mask2former Swin Base IN21k Cityscapes Panoptic

Mask2Former is a general-purpose image segmentation model based on Transformer architecture, capable of handling instance segmentation, semantic segmentation, and panoptic segmentation tasks.

Image Segmentation

Mask2former Swin Tiny Ade Semantic

Mask2Former is a unified image segmentation model based on Transformer, capable of handling instance segmentation, semantic segmentation, and panoptic segmentation tasks.

Image Segmentation

Mask2former Swin Large Ade Semantic

A large-scale version based on the Swin backbone network, trained on the ADE20k semantic segmentation dataset, employing a unified paradigm for image segmentation tasks.

Image Segmentation

Mask2former Swin Base IN21k Ade Semantic

Mask2Former is a universal image segmentation model capable of handling instance segmentation, semantic segmentation, and panoptic segmentation tasks by predicting a set of masks and their corresponding labels.

Image Segmentation

Mask2former Swin Base Ade Semantic

A general-purpose image segmentation model trained on the ADE20k dataset, using a unified framework to handle instance/semantic/panoptic segmentation tasks

Image Segmentation

Mask2former Swin Large Ade Panoptic

Mask2Former model trained on the ADE20k panoptic segmentation dataset using a Swin large backbone network, employing a unified paradigm to handle instance segmentation, semantic segmentation, and panoptic segmentation tasks.

Image Segmentation

Mask2former Swin Large Mapillary Vistas Semantic

A large-scale Mask2Former model based on the Swin backbone network, designed for general image segmentation tasks, unifying instance segmentation, semantic segmentation, and panoptic segmentation.

Image Segmentation

Mask2former Swin Large Cityscapes Semantic

A large-scale Mask2Former model based on the Swin backbone network, specifically trained for Cityscapes semantic segmentation tasks, adopting a unified architecture for various image segmentation tasks.

Image Segmentation

Mask2former Swin Small Cityscapes Panoptic

A compact Mask2Former model based on Swin backbone network, optimized for panoptic segmentation tasks on the Cityscapes dataset

Image Segmentation

Mask2former Swin Large Cityscapes Panoptic

Mask2Former model based on Swin backbone network, specifically optimized and trained for panoptic segmentation tasks on the Cityscapes dataset

Image Segmentation

Mask2former Swin Tiny Cityscapes Panoptic

Mask2Former model based on Swin-Tiny backbone, optimized for Cityscapes panoptic segmentation tasks

Image Segmentation

Mask2former Swin Tiny Coco Panoptic

Mask2Former is a Transformer-based unified image segmentation model supporting instance segmentation, semantic segmentation, and panoptic segmentation tasks, utilizing masked attention mechanism to enhance performance

Image Segmentation

Mask2former Swin Small Coco Panoptic

A small-scale version of Mask2Former based on Swin backbone network, optimized for panoptic segmentation tasks on the COCO dataset

Image Segmentation

Mask2former Swin Large Coco Panoptic

A large-scale version of Mask2Former based on the Swin backbone network, specifically trained for panoptic segmentation tasks on the COCO dataset

Image Segmentation

Mask2former Swin Base Coco Panoptic

The Mask2Former model based on the Swin backbone network, trained on the COCO panoptic segmentation dataset, adopts a unified paradigm to handle instance segmentation, semantic segmentation, and panoptic segmentation tasks.

Image Segmentation

Oneformer Coco Dinat Large

A unified single Transformer architecture for image segmentation, supporting three major tasks: semantic segmentation, instance segmentation, and panoptic segmentation

Image Segmentation

Oneformer Ade20k Dinat Large

The first multi-task universal image segmentation framework supporting semantic/instance/panoptic segmentation with a single model

Image Segmentation

Detr Resnet 50 Panoptic

DETR is an end-to-end object detection model based on Transformer architecture, using ResNet-50 as the backbone network, trained on the COCO dataset, and supports object detection and panoptic segmentation tasks.

Image Segmentation

Maskformer Swin Large Coco

Large-scale MaskFormer model based on Swin backbone network, unifying instance/semantic/panoptic segmentation tasks

Image Segmentation

Maskformer Swin Small Coco

A small MaskFormer model based on the Swin backbone network, trained on the COCO dataset for panoptic segmentation tasks.

Image Segmentation

Maskformer Swin Base Coco

A panoptic segmentation model based on the Swin backbone network, trained on the COCO dataset, unifying instance/semantic/panoptic segmentation tasks

Image Segmentation

Maskformer Swin Tiny Coco

A panoptic segmentation model trained on the COCO dataset, using a unified paradigm to handle instance/semantic/panoptic segmentation tasks

Image Segmentation

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase